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Abstract: Hepatocellular carcinoma (HCC) remains a difficult disease to study even after a 
decade of genomic analysis. Patient and disease heterogeneity, differences in statistical 
methods and multiple testing issues have resulted in a fragmented understanding of the 
molecular basis of tumor biology. Some researchers have suggested that HCC appears to 
share pathways with embryonic development. Therefore we generated targeted hypotheses 
regarding changes in developmental genes specific to the liver in HCV-cirrhosis and 
HCV-HCC. We obtained microarray studies from 30 patients with HCV-cirrhosis and 
49 patients with HCV-HCC and compared to 12 normal livers. Genes specific to non-liver 
development have known associations with other cancer types but none were expressed in 
either adult liver or tumor tissue, while 98 of 179 (55%) genes specific to liver 
development had differential expression between normal and cirrhotic or HCC samples. 
We found genes from each developmental stage dysregulated in tumors compared to 
normal and cirrhotic samples. Although there was no single tumor marker, we identified a 
set of genes (Bone Morphogenetic Protein inhibitors GPC3, GREM1, FSTL3, and FST) in 
which at least one gene was over-expressed in 100% of the tumor samples. Only five genes 
were differentially expressed exclusively in late-stage tumors, indicating that while 
developmental genes appear to play a profound role in cirrhosis and malignant 
transformation, they play a limited role in late-stage HCC. 
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1. Introduction 

1.1. Background and Motivation 

Hepatocellular Carcinoma (HCC) ranks fifth among all cancers and third in mortality, accounting 
for hundreds of thousands of deaths per year. One-year survival rates remain less than 50% in the 
United States, despite advances in therapy [1]. HCC is the primary cancer morbidity evolving over 
decades in underlying hepatitis C (HCV) liver pathology in North America and Japan. HCC 
development is generally thought to be a multistep process resulting from hepatocyte turnover, chronic 
inflammation, regeneration, oxidative stress, DNA damage, and cirrhosis, as well as direct viral injuries. 
Unfortunately, the specific molecular mechanisms underlying hepatocarcinogenesis remain unclear. 

In the last ten years, microarray technology has been a powerful tool to study the molecular basis of 
disease. By measuring whole-genome transcript levels, expression patterns associated with liver 
dysfunction have been examined. However, HCC remains a difficult disease to study, with widely 
variable findings between studies and several proposed non-overlapping gene signatures [2-12]. This 
is likely due not only to the biological heterogeneity of HCC pathogenesis, but also reflects the varied 
clinical background of patients and variation in statistical technique. There are significant statistical 
challenges which plague the analysis and interpretation of microarray experiments. Differences in 
technique in every stage of data pre-processing have been demonstrated to dramatically affect the end 
results, including background correction [13], normalization [14,15], and probe set summarization [16]. 

In addition, traditional statistical approaches are not particularly well-suited to cancer genomic data. 
When simultaneously comparing many thousands of genes, multiple testing problems are considerable. 
Worse, because the assumption of independent tests is often violated, actual false positive rates can be 
much higher than estimated by standard methods [13-15]. This implies that, even using conservative 
multiple testing correction methods, cancer studies which generate thousands of significantly 
differentially expressed genes could have several hundred false positive results. To reduce this 
problem, we used a knowledge-driven approach, using what is already known about normal and 
disease processes to generate hypotheses that can be tested with a relatively small number of statistical 
tests [17-19]. 

Another difficulty stems from the heterogeneity of cancer processes, in which changes in the 
expression of important genes occur only in subsets of tumors. This results in skewed density curves 
(sometimes even bi-modal) that may not be easily detected by means-based tests. Most statistical tests 
in common use are based on comparing the magnitude of mean change relative to the variation. These 
tests also place focus on the largest magnitude changes which are often products of tumor behavior, 
such as increased metabolism and cell proliferation/turnover, rather than drivers that often have 
smaller fold-changes [20]. We suspect that there are modest changes in the expression of critical genes 
that may be difficult to distinguish from "noise" in the data, but may have a significant impact on 
tumor development [21,22]. To address this we applied techniques that find patterns in the data (such 
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as dimension reduction techniques), focused on such genes, to identify those that drive tumor behavior 
even if they have modest expression changes or skewed distribution patterns. 

1.2. Questioning Biological Randomness in HCV-HCC 

Tumors are widely believed to arise through an accumulation of random mutations. Mutations and 
chromosomal instability have been demonstrated in several carcinomas, including non-Hodgkins 
lymphoma, ovarian, colorectal, and oral tissue types [23-27]. HCC arising from chronic HBV 
infection has also been associated with mutations via HBx gene interference with p53 binding, leading 
to faulty DNA repair mechanisms [28,29]. It would be natural to think that the widespread 
dysregulation of gene expression in HCV-HCC is also largely random. However, HCV-HCC may be 
unusual because hepatitis C is an RNA virus that codes proteins that have direct interaction with over 
thirty host proteins. Tumors emerge from an environment of decades of host response to infection and 
liver damage. Therefore we hypothesize that induction of HCC in chronic HCV liver pathology may 
depend more on host response to chronic infection and HCV-host interactions than on direct DNA 
damage. If this is true, the effects of the HCV virus will be seen in the perturbation of the "tools at 
hand": gene expression changes that might be expected include modified expression of genes already 
in use in the liver (including genes expressed by activated hepatic stellate cells), target genes of host 
proteins that HCV proteins interact with, and genes used in the liver's own life history. Such genes 
contain the specific transcription factor binding sites (TFBS) that are responsive to the transcription 
factors expressed in the liver, while genes that are not normally expressed in the liver are responsive to 
different promoters. For instance, the promoter region for FGF7 (expressed in the embryonic liver) 
contains binding sequences for ATF2, FOXD1, HNF3B, STAT3, and JUN which are all expressed in 
the liver and dysregulated in liver disease. This reasoning also implies that genes never expressed by a 
healthy liver would not be expected to be activated by HCV-induced tumors to the same extent as in 
HBV-HCC or other cancers. 

To further target our hypotheses, we reviewed the current knowledge of processes involved in 
HCC. For instance, it has recently been noted that there appear to be pathways common to both cancer 
and embryonic development in HCC and other cancers [30,31]. In the context of the hypothesis of 
non-random response to HCV as described above, this led us to question whether any developmental 
genes involved in HCC are specific to liver development, and if paralog genes (similar in structure and 
function in other tissues) remain dormant. In this paper we demonstrate that HCV-induced liver 
cirrhosis and HCC do indeed show a general pattern of differential expression of liver development 
genes compared to paralog genes that have similar roles in the development of other tissues. Many of 
these developmental genes are up- or down-regulated in cirrhotic livers in a coherent way (clustering 
closely together), then degenerating into widely variable expression patterns in tumors. Some of the 
genes identified in this manner are already associated with HCC, while others appear to be novel. We 
also observed that some of these important embryonic signals are secreted from mesodermal tissues 
during development. These same signaling molecules may be secreted from mesodermally-derived 
stellate cells in adults. However, these cells comprise less than five percent of adult liver volume, 
which may result in an observed low signal that may have been difficult to distinguish from noise in 
previous studies. 
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1.3. Overview of Liver Development 

Liver development is a multi-stage process orchestrated by nearly 200 master regulators, growth 
factors, and their receptors. Growth factors secreted externally and from within the developing liver 
bind receptors on the surface of liver cells, which transduce signals to transcription factors (TFs) 
within the nucleus. These transcription factors, either individually or as co-factors, regulate a complex 
program of inducing or repressing access to gene transcription by a number of activities including 
chromatin decompaction, recruitment of chromatin remodeling complexes, and histone marker 
methylation, demethylation, or acetylation, as well as by physically blocking or recruiting RNA 
polymerase. For example, some of the earliest TFs that induce hepatic fate from the endoderm 
(FOXA1-3, GATA4/6) open the chromatin structure around early hepatic markers such as albumin and 
alpha fetoprotein (AFP), while [3-catenin (CTNNB1), LEF1, and TCF3 recruit chromatin remodeling 
complexes, and ARID5B and ATF2 modify histone markers of target genes. See Si-Tayeb et al. for a 
recent review [32] of embryonic liver development. 

In the neonatal period, many of these regulators are repressed in most parts of the liver, while others 
maintain some level of activity throughout adult life either playing roles in metabolism and 
homeostasis or in maintaining niches of hepatic stem cells. 

2. Results 

2.1. Description of Patient Population 

Microarray studies were obtained over a 10 year period from 180 samples of cirrhotic tissue and tumors 
collected from 140 patients with chronic HCV infection. As described in the Experimental Section, 
stringent quality control criteria were applied minimize technical artifacts, leaving us with 
30 HCV-cirrhosis (CIR) and 49 HCV-HCC (HCC) tumors. These were compared to a control group of 
12 samples taken from non-diseased, deceased donor livers (NOR). Tissue was obtained from 31 early 
HCC (stage Tl and T2) and 18 late HCC (T3 and T4). Twenty-nine HCC patients were transplanted, 
six died on the waiting list and 14 were never listed for transplant due to age, stage of cancer, or 
other co-morbidities. 

2.2. Differentially Expressed Genes Are Specific to Liver Development 

We hypothesized that genes not normally expressed in adult livers are less likely to be 
transcriptionally activated in HCV-HCC. To test this, we identified a set of genes with zero counts in 
an RNA sequencing study on a normal liver sample. 1,399 were represented on the Affymetrix 
U133Av2 genechip. We tested for expression in HCC compared to normal samples in our dataset 
using a one-sided Kolmogorov-Smirnov (K-S) test of identical distribution. Of the 1,399 genes, none 
were expressed in the HCC samples (a < 0.001). 

More specifically, we wished to determine whether the general developmental pathways altered in 
HCC were using genes specific to liver development, or whether any member of the gene family might 
be engaged. To examine this, we identified 26 paralog genes that have highly related developmental 
functions in other tissues that are not expressed in normal healthy livers (based on the RNA 
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sequencing data and verified with a literature search). In our data, no paralogs were expressed in HCC 
compared to normal samples (a < 0.001) (Supplementary Table SI, column A). We validated this to 
an independently collected HCV-HCC dataset from Wurmbach et al. (see Experimental Section), 
which also had no expression of these paralog genes (Supplementary Table SI, column B). Next, we 
compared the expression patterns of liver genes to their non-liver paralogs in tumors. Most (31/33, 
94%) of the liver genes had significantly different expression distributions compared to their paralog 
(Supplementary Table SI, column C). Figure 1 shows examples of the typically observed patterns of 
expression and poor regulation. Even when a liver gene is not differentially expressed in HCC, it often 
seems to be more poorly regulated with a wider distribution pattern than is seen in normal tissue. 

Figure 1. Selected density plots of liver development vs. paralog non-liver development 
genes. Expression densities are shown for gene pairs in normal and HCC samples from our 
data and in the Wurmbach dataset. Red = expression of the liver gene in HCC; 
Green = expression of liver gene in normal controls; Blue = expression of non-liver gene in 
HCC; Purple = expression of non-liver gene in normal controls. Paralog genes (RXRG and 
SOX1) were not expressed in HCC or normal samples, while liver development genes 
RXRA and SOX9 were differentially expressed in HCC. These patterns were also observed 
in the Wurmbach dataset. 

VCUHS WURMBACH 

RXRA RXRA 




Normalized signal intensity N = 54 Bandwidth = 0.2183 



Cancers 2012, 4 



950 



2.3. Patterns of Expression in HCV-Cirrhosis and HCV-HCC 

We then identified patterns of expression for liver development genes in HCV-CIR and HCV-HCC. 
To capture changes in either mean or variation, we assessed significance with a combined p-value from 
both t and F tests. We were particularly interested in whether the genes associated with progression from 
cirrhosis to cancer were those particular to a certain developmental stage. However, differentially 
expressed genes were found from all stages of development, and PCA plots of stage-specific genes all 
showed discrimination between normal, cirrhotic, and tumor samples (data not shown). 

Twenty-nine developmental genes had a pattern of higher magnitude over-expression in cirrhosis, 
then declining values in HCC (Table 1). These include several extra-cellular matrix (ECM) genes and 
members of the TGFp/BMP signaling pathway. Transcription factors following this expression pattern 
include SOX9, GATA6, HAND2, and IRS2. The only genes differentially expressed between early (Tl 
and T2) and late-stage (T3 and T4) tumors were EPCAM and tumor suppressor KLF6. Nine genes 
were down-regulated in cirrhosis and remained low in tumors: transcription factors FOXA1, FOXA2, 
GATA4, HNF1A and STAT3; growth factor receptors ACVR2B, RXRA; signaling molecule NRTN, and 
MMP15, which degrades intracellular matrix proteins. 



Table 1. Liver development genes with significantly higher expression in cirrhosis than 
tumor samples. Fold-changes are relative to normal samples. 









Mean 


Mean FC 


Mean FC 


Gene 


Gene Name 


Gene Function 


FC 
CIR 


Early 
HCC 


Late 
HCC 


EPCAM 


Epithelial cell adhesion molecule 


ECM 


14.8 


14.0 


5.7 * 


MMP7 


Matrix metalloproteinase 7 


ECM 


7.5 


6.1 * 


3.5 * 


KRT19 


Cytokeratin- 1 9 


Epidermal IF 


6.0 


2.9 * 


1.8 * 


MMP2 


Matrix metalloproteinase 2 


ECM 


5.4 


4.8 * 


3.0 * 


VIM 


Vimentin 


Mesenchymal IF 


5.8 


5.2 * 


4.8 * 


SOX9 


SRY-box 9 


TF 


4.9 


3.2 * 


2.9 * 


LAMA2 


Laminin alpha 2 


ECM 


4.4 


2.3 * 


1.9* 


FGFR2 


Fibroblast Growth Factor Receptor 2 


GF receptor 


4.3 


2.4 * 


1.8 * 


KLF6 


Kruppel-like factor 6 


TF 


3.9 


2.6 * 


1.8 * 


COL4A2 


Collagen IV alpha 2 


ECM 


3.9 


2.6 * 


2.3 * 


LAMB1 


Laminin beta 2 


ECM 


3.5 


2.8 * 


1.5 * 


ARID5B 


AT rich interactive domain 5B 


TF 


3.4 


1.8 * 


1.7 * 


FSTL3 


Follistatin-like protein 3 


GF antagonist 


3.4 


1.7 * 


1.5 * 


TGFB1 


Transforming growth factor, beta 1 


GF 


3.2 


2.1 * 


1.5 * 


SMAD7 


SMAD family member 7 


Signal 
transduction 


3.2 


1.9 * 


1.4 * 


CITED2 


CBP/p300-interacting transactivator 


TF 


2.8 


2.0 * 


1.7 * 


GATA6 


GATA binding protein 6 


TF 


2.8 


1.6 * 


0.9 * 


SFRP5 


Secreted frizzled-related protein 5 


Wnt inhibitor 


2.6 


1.8 * 


1.5 * 


ID3 


Inhibitor of DNA binding 3 


TF antagonist 


2.4 


1.7 * 


1.3 * 


LAMC3 


Laminin gamma 3 


ECM 


2.4 


1.9 * 


1.3 * 


HAND2 


Heart- and neural crest derivatives- 
expressed protein 2 


ECM 


2.2 


1.7 * 


1.4 * 
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Table 1. Cont. 









Mean 


"Ik M T7I /"""•< 

Mean * C 


"Ik M T7I /"""•< 

Mean h C 


Gene 


Gene Name 


Gene Function 


*c 


Early 


Late 








C1K 


HCC 


Tl/'/' 


NDN 


Necdin 


TF 


2.2 


1.3 * 


1.3 * 


PTN 


Pleiotrophin 


GF 


2.1 


1.6 * 


1.4 * 


r7r\ rpi-» r\ f\ 

ZBTB20 


Zinc linger and BTB domain containing 20 


TF 


2.1 


1.4 * 


1.3 * 


CDH1 


Cadherin 1 


ECM 


1.8 


1.5 * 


1.5 * 


FGF7 


Fibroblast growth factor 7 


GF 


1.7 


1.3 * 


1.2 * 


BMP2 


Bone morphogenic protein 2 


GF 


1.6 


1.4 * 


1.3 * 


COL4A4 


Collagen IV alpha 4 


ECM 


1.6 


1.2 * 


1.2 * 


CSNK1D 


Casein kinase I isoform delta 


kinase 


1.5 


1.1 * 


1.0 * 


IRS2 


Insulin receptor substrate 2 


GF receptor 


1.4 


1.1 * 


1.1 * 



* denote genes that are differentially expressed compared to cirrhosis (a < 0.001). Abbreviations: 
TF = transcription factor; ECM = Extra-cellular matrix; IF = intermediate filament; GF = growth factor. 



Sixteen genes were differentially expressed uniquely in the tumors (Table 2). Magnitude of change was 
modest, however six of the 16 genes were transcription factors or activators (TBX3, HHEX, ATF2, 
FOXM1, PROX1 and STAT3) which might be expected to induce large effects with small expression 
changes. Thirty-five more genes were differentially expressed in cirrhosis and either had similar 
expression in tumors (25 genes) or had larger magnitude changes in HCC (10 genes). For example, 
GPC3 was up-regulated slightly in cirrhosis (x2), and greatly up-regulated in both early ( x 7.2) and late 
(xl0.4) tumors. 



Table 2. Genes uniquely changed in HCC (a < 0.001). 



Gene 


Gene Name 


Gene 
Function 


FC Early 
HCC 


FC Late 
HCC 


DKK1 


Dickkopf-related protein 1 


Wnt inhibitor 


3.8 


1.8 


MMP1 


Matrix metalloproteinase 1 


ECM 


2.4 


1.5 


FST 


Follistatin 


GF antagonist 


1.9 


2.1 


TBX3 


T-box 3 


TF 


1.3 


2.2 


MAP4K4 


Mitogen-activated protein kinase kinase kinase kinase 4 


kinase 


1.3 


1.3 


INHBA 


Activin 


GF 


1.3 


1.3 


HHEX 


Hematopoietically expressed homeobox 


TF 


1.3 


1.3 


ATF2 


Activating transcription factor 2 


TF 


1.3 


1.4 


BSG 


Basigen 


ECM receptor 


1.3 


1.4 


LAMA4 


Laminin alpha 4 


ECM 


1.2 


1.3 


FOXM1 


Forkhead box Ml 


TF 


1.2 


1.1 


KRAS 


v-Ki-ras2 Kirsten rat sarcoma viral oncogene homolog 


GTPase 


0.9 


0.7 


PROX1 


Prospero homeobox 1 


TF 


0.7 


0.5 


TGFBR3 


Transforming Growth Factor beta receptor 3 


GF 


0.7 


0.6 


MST1 


Macriogage stimulating 1 (hepatocyte growth factor-like) 


GF 


0.6 


0.6 


STAT3 


Signal transducer and activator of transcription 3 


TF 


0.6 


0.5 



Abbreviations: TF = transcription factor; ECM = Extra- cellular matrix; GF = growth factor. 
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Overall, 98 of the 179 (55%) genes critical to liver development had altered expression patterns in 
cirrhosis and early stage tumors. Only five of the genes were uniquely changed in late-stage tumors: 
ACVR1, HMGA2, IGF2, CP and YAP1. A complete list of all 179 liver development genes tested, and 
relative expression in each disease group can be found in Supplementary Table S2. 

2.4. Functional Gene Sets That Discriminate between Normal, Cirrhosis, and Tumor Samples 

From these significant liver development genes we have been able to identify some specific 
functional groups. Genes related to extra-cellular matrix (ECM) maintenance or remodeling 
demonstrated major changes in both cirrhosis and tumors. PCA of the significantly changed genes 
demonstrate that these genes also independently discriminate between normal, cirrhosis, and tumor 
samples (Figure 2A). The BMP signaling pathway is also highly dysregulated in HCV-cirrhosis and 
HCC. BMP2, its receptors, and BMP inhibitors are all differentially expressed in cirrhosis and HCC. 
Embryonically, BMP signaling is antagonistic to FGF signaling and this balance is controlled by the 
DAN family of BMP antagonists from mesenchymal cells and GPC3 expressed by hepatocytes. 
BMP2, BMPR1A, FGF7, FGFR2 and ID3 were more highly expressed in cirrhosis than HCC, while 
the BMP inhibitors were more highly expressed in tumors. At least one of the inhibitors GPC3, 
GREM1, FSTL3, and/or FST were over-expressed (FC > 1.5) in 100% of tumor samples 
(Supplementary Table S3). A PCA plot demonstrates the gene set's ability to discriminate most tumors 
from normal and cirrhosis samples (Figure 2B). Leave-one-out cross-validation (LOOCV) of ROC 
curves assessed predication sensitivity of 95.9% and specificity of 83.3%. 

Figure 2. PCA plots of (A) ECM genes and (B) BMP2 and its receptors and inhibitors. 
Green = normal control livers; Blue = cirrhosis; Red = early stage HCC; Black = late stage 
HCC. Normal tissues cluster well away from either cirrhosis or tumors. Both the ECM 
genes (A) and BMP inhibitors (B) were able to discriminate between cirrhosis and many of 
the tumor tissues. 



(A) Extra-cellular matrix genes 
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Figure 2. Cont. 



(B) BMP2, BMP receptors and inhibitors (GREM1, FST, FSTL3, GPC3) 
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Wurmbach data 



o 
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-2 



2 



4 



-1 



0 



2 



PC 1 



PC1 



These findings were validated against the Wurmbach dataset by applying the PCA loadings 
generated from our data to their dataset, and we observed similar patterns of separation between 
normal, cirrhosis, and HCC tissues (Figure 2A,B). The ECM and BMP genesets were well described 
with the first two principal components. 

3. Discussion 

Hepatocellular carcinoma is widely recognized as a highly heterogenous disease, which has made it 
difficult to characterize. However, because HCV is RNA virus that exerts its effects by direct protein 
interactions and host response to chronic infection and inflammation, we questioned the common 
perception that the gene expression changes observed in tumors are likely due to random activation of 
genes arising from mutations and DNA damage. We screened a total of 1,425 genes that were 
demonstrated by RNA seq analysis to have no expression in healthy liver. None of them were 
expressed in our HCV-HCC samples. This data suggests that the progression from cirrhosis to HCC in 
patients with chronic HCV infection is not accompanied by random gene activation, as has often been 
supposed. In particular, 26 of these genes have developmental roles highly paralogous to liver 
development genes and known association with carcinoma of other tissues, but remain under tight 
transcriptional control in both cirrhotic and HCC tissues. For example, BMP 3 inactivation is associated 
with colorectal cancer [33], gastric cancer [34], and pancreatic cancer [35]; CDH3 is over-expressed in 
several types of cancer including breast [36] and pancreatic [37]; FGF3 is amplified in ovarian [38], 
breast [39], and bladder cancer [40]; FGF12 is amplified in esophageal squamous cell carcinoma [41] 
and squamous cell carcinoma of the lung [42], and GATA1 mutations have recently been associated 
with acute megakaryoblastic leukaemia [43] and breast cancer [44]. 

Using a knowledge-driven approach, we have shown that genes critical to every stage of liver 
development are differentially expressed in HCV-cirrhosis and further deregulated in tumors. In 
general we observed a pattern of modest changes in genes differentially expressed in cirrhosis and 
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HCC. Mean fold-change in tumors was <2x up or down in many cases. Some of these genes are 
expressed by mesodermally-derived tissues in the embryonic liver and thus may also be expressed 
exclusively by the activated hepatic stellate cells (HSCs) in the adult liver. Since these cell types make 
up less than five percent of the liver volume in HCC, even large changes in gene expression from the 
HSC would result in modest overall signal change in a tissue sample. BMP2, BMP4, BMPR1A, HGF, 
LHX2, FST, SOX9, GREM1, FGF7, TGFB1, MMP2, TIMP2, SMAD7, KLF6 and JUN have 
experimentally validated expression in activated hepatic stellate cells [45-49]. All of these genes 
except BMP4, HGF, and LHX2 were differentially expressed in our study. As predicted, most of these 
have modest but significant mean fold-change. 

The major pattern that emerged from our detailed analysis of these important genes was the BMP 
pathway. Embryonically, BMP2 and BMP4 are secreted from the mesenchymal STM and act 
antagonistically against FGF2 and FGF4 to regulate rate of proliferation. Later in development, FGF7 
and FGFR2 promote differentiation into biliary epithelial cells. In healthy adults, BMP2 is secreted 
from hepatic stellate cells to suppress hepatocyte proliferation. During normal liver regeneration (such 
as after partial hepatectomy), BMP2 is down-regulated and hepatocyte proliferation is activated by at 
least four distinct and redundant mechanisms: FGF7 and FGFR2 [50]; KDR (aka VEGFR2), ID1, 
HGF, and WNT2 [51]; TNF, NFKB1, STAT3, and IL6 [52]; and p-catenin and Cyclin Dl [53]. During 
active hepatitis C infection, the HCV core protein induces over-expression of BMP2, which then 
participates in the activation of hepatic stellate cells and also functions to suppress hepatocyte 
proliferation in response to liver damage. In our dataset, late-stage cirrhotic livers had elevated levels 
of BMP2, its receptor, and some of its downstream effectors (SMADs and ID transcription factors). 
The proliferation promoters noted above were generally unchanged in cirrhotic tissue, while TNF and 
P-catenin were under-expressed. FGF7 and FGFR2, however, were over-expressed in both cirrhosis 
and HCC. This suggests a complex and somewhat chaotic response to active HCV infection and liver 
damage in the cirrhotic liver. However, many cirrhotic tissues and every tumor in our samples 
over-expressed at least one of the BMP inhibitors (FC > 1.5, see Supplementary Table S3), with higher 
expression in tumors. This data suggests there is a chronic struggle in the cirrhotic liver to maintain 
homeostasis of the BMP pathway, and that tumors might have found a way to overcome the 
proliferative inhibition imposed by BMP2 by sufficiently up-regulating any of the BMP inhibitors. In 
the tumors, most of the proliferation promoters had expression values similar to that of normal tissue 
(which is sufficient to induce proliferation in the absence of active BMP2), including those that were 
down-regulated in cirrhosis. 

Several of the developmental genes expressed in adult livers have been shown to have roles in 
maintaining tissue differentiation or regulating cell proliferation. BMPs and their DAN family 
antagonists have opposing effects on Wnt signaling and the BMP-GREM-Wnt circuit has been 
proposed as a mechanism to maintain stem cell niches in the colon [54]. FOXA1, FOXA2, HLX and 
SFRP5 are expressed in progenitor cells, while SOX9, SOX17, FOXA2 and KIT have confirmed 
expression in hepatic stem cells [55]. FOXA1-3 drive differentiation of hepatocyte stem cells. Merlin 
(NF2) has recently been proposed as a regulator of liver stem cells, with deletion leading to HCC in rat 
models [56]. In our data, FOXA1, FOXA2, MST1 and NF2 were down-regulated, suggesting that in this 
population there is a general pattern of de-differentiation of hepatocytes within tumors and acquisition 
of stem-cell-like properties. 
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4. Experimental Section 

4.1. Study Population 

Since 1997, HCV patients diagnosed with cirrhosis and HCC have been evaluated and treated at the 
Hume-Lee Transplant Center at VCUHS according to an Institutional Review Board approved study 
protocol [57]. Informed consent was obtained from all patients. After staging, HCC patients had their 
tumors ablated and were evaluated for liver transplant according to the United Network for Organ 
Sharing criteria. Tissue samples were collected from biopsies and explanted livers according to 
protocols established by the Liver Tissue Cell Distribution System (Richmond, Virginia, funded by 
NIH Contract #N01-DK-7-0004/HHSN267200700004C). Control liver samples were obtained from 
explanted donor livers. Donor livers were shown to have normal function and were negative for 
hepatitis C virus antibodies. 

An independently published dataset of HCV-cirrhosis and HCC was also obtained for verification 
of results [58] (Wurbach et al, NCBI GEO database accession GE06764). In addition, the absolute 
expression levels of target genes in a normal adult human were obtained from the BodyMap gene 
expression database [59] and mapped with Burrows- Wheeler Alignment tool (BWA) [60] with default 
parameters. In cases where BodyMap results were inconclusive (counts of 0-40), a literature search 
was performed to confirm adult expression of target genes. 

4.2. Sample Preparation 

Pre-transplant biopsies and explanted livers were sectioned and grossly examined. Samples from 
tumors and cirrhotic liver tissue (according to diagnosis and pathological examination) were freshly 
snap-frozen and processed in the Hume-Lee Transplant Center Molecular Diagnostic Laboratory. 
Liver tissue samples were collected in liquid nitrogen or RNAlater solution (Ambion, Austin, TX, 
USA) and stored at -80 °C until use. Explanted livers were sliced at intervals of 4-5 mm, and all 
nodules suspicious for HCC processed for light microscopy. Only tumor samples with more than 85% 
tumor cell content were used for the microarray studies. Normal and necrotic tissues were 
macro-dissected from the sample. 

With minor modifications, the sample preparation protocol follows the Affymetrix GeneChip 
Expression Analysis Manual (Affymetrix, Santa Clara, CA, USA). After hybridization and scanning, 
the microarray images were checked for major chip defects or abnormalities in the hybridization 
signal. Total RNA quality and integrity of each sample were analyzed using the Agilent 2100 
Bioanalyzer (Agilent Technologies), and products of cDNA synthesis and in vitro transcription (IVT) 
were tested before being considered for microarray analysis using the Agilent 2100 Bioanalyzer 
(cDNA synthesis 1.5 kb < cDNA < 5.0kb; IVT l.Okb < cRNA < 4.5 kb). 

All chips in the study were examined with several quality control tests [61,62]. Any chip that fell 
well outside the recommendations for any of the quality assessment tests was excluded from 
further analysis. 
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4.3. Statistical Methods 

Data files were read into the R (version 2.13) programming environment using updated probe 
annotations from the BrainArray project (version 14.1.0, HGU133A2_Hs_REFSEQ), which have been 
shown to improve accuracy of probe — gene mapping over the standard Affymetrix annotation [63]. 

Robust Multichip Average (RMA) pre-processing is broadly accepted as robust, easy to implement, 
and widely applicable. However, we were concerned that the assumptions of RMA may not be met 
(that only a small proportion (1-5%) of genes are differentially expressed, and that about the same 
number of genes are over- vs. under-expressed). Therefore, we processed a test dataset using RMA and 
carefully examined differential expression results. Comparison of group contrasts identified 25-45% 
genes that were called as differentially expressed at an FDR < 0.05, and 2/3 of those genes were 
over-expressed in tumors. This suggested to us that quantile normalization may not be an appropriate 
method for this dataset. Instead, we first performed background correction, then normalized the data 
using non-parametric, distribution-free regression on technical covariates of probes (GC content, 
melting temperature, and probe location) to estimate and correct for systematic bias [64]. 

We used scaled Principal Components Analysis (PCA) on specific gene sets to explore the relationship 
between those genes and disease behavior. We used the function "prcomp" with scale = TRUE in the 
R environment [65]. 

Normally regulated genes generally have a narrow distribution with small variation, while poorly 
regulated genes may have broad, flat distribution of expression values or long thick tails and high 
variability. Genes that are differentially expressed in a subset of tumor samples will have a skewed 
distribution. Either of these situations violate the assumption of normal distribution and can be difficult 
to identify with a t-test. Instead, we used F-test of variance to identify high-variance genes and the 
non-parametric two-sample Kolmogorov-Smirnov Test to test differences in both location and shape of 
the distributions. 

4.4. Identification of Test Gene sets 

In this study, we investigated the hypothesis that genes critical to the development of the liver are 
poorly regulated or preferentially activated in HCV-cirrhosis and HCV-HCC. There are five main 
stages of liver development: hepatic fate specification, hepatoblast migration, liver bud growth, 
hepatocyte/biliary differentiation, and maturation. We identified 179 genes (via literature review) that 
are experimentally determined to be critical regulators necessary for normal liver development (See 
Supplementary Table S2 for a complete list). We also wished to determine whether liver development 
genes are more likely than their non-liver paralogs (which have highly similar functions in the 
development of other tissue types) to be dysregulated in liver tumors. We identified 26 such genes that 
are not normally expressed in either embryonic or adult liver tissue (Supplementary Table SI). 

5. Conclusions 

In summary, there is growing awareness that hepatocarcinogenesis shares many features with liver 
organogenesis. Recent reviews have compared the pathways that appear to be involved in both 
development and cancer [66,67]. We show here, for the first time to our knowledge, that tumors 



Cancers 2012, 4 



957 



dysregulate the genes specific to their own development rather than closely related genes in the same 
pathways. We further demonstrate that these closely related paralogs, which have established roles in 
cancers of other tissues, remain under good transcriptional control in HCV-induced cirrhosis and HCC 
even in late-stage tumors. We have found that the earliest signals responsible for guiding the initiation 
and development of the embryonic liver are correlated with the development of cirrhosis and HCC due 
to chronic Hepatitis C infection, but do not appear to be involved in the progression from early to late 
stage cancer. These observed patterns of expression were validated in an independently collected 
HCV-HCC microarray experiment. By applying what is known about the control of these genes in 
normal liver development and their aberrant control in HCC, we can begin to model knock out and 
deletion experiments to further define how these pathways can be interrupted or stimulated to impact 
HCC occurrence or recurrence. 
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Supplementary Material 



Table SI. Liver development genes compared to their non-liver paralogs. A: one-sided K-S 
test of identical distribution comparing HCC to normal samples; B: on-sided K-S test in the 
Wurmbach dataset; C: K-S test comparing the liver development gene to it's non-liver paralog. 
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Table S2. All liver development genes tested and their relative expression in HCV-cirrhosis, 
HCC-cirrhotic tissues, early stage HCC and late stage HCC. Fold-change (FC) is relative to 
normal controls. 



Probe Set ID 


Gene name 


Mean FC HCV-CIR 


Mean FC Early HCC 


Mean FC late HCC 


NM_001105_at 


ACVR1 


1.01 


1.09 


0.90 


NM_004302_at 


ACVR1B 


0.92 


0.86 


0.87 


NM_001616_at 


ACVR2A 


0.82 


0.96 


0.96 


NM_001106_at 


ACVR2B 


0.50 


0.58 


0.59 


NM_001663_at 


ARF6 


1.18 


1.11 


1.11 


NM_032199_at 


ARID5B 


3.62 


2.13 


1.86 


NM_001880_at 


ATF2 


0.93 


1.27 


1.41 


NM_006856_at 


ATF7 


0.92 


0.93 


0.96 


NM_006395_at 


ATG7 


0.88 


0.91 


0.97 


NM_001200_at 


BMP2 


1.56 


1.42 


1.32 


NM_001202_at 


BMP4 


0.77 


1.02 


1.09 


NM_004329_at 


BMPR1A 


1.38 


1.37 


1.32 


NM_001203_at 


BMPR1B 


0.97 


1.01 


1.05 


NM_001204_at 


BMPR2 


1.00 


1.09 


1.02 


NM_001728_at 


BSG 


1.28 


1.23 


1.31 


NM_014333_at 


CADM1 


1.60 


1.68 


1.89 


NM_053056_at 


CCND1 


1.38 


1.34 


0.81 


NM_001759_at 


CCND2 


1.16 


1.16 


1.22 


NM_057749_at 


CCNE2 


1.17 


1.81 


2.16 


NM_004360_at 


CDH1 


1.74 


1.49 


1.41 


NM_004364_at 


CEBPA 


0.61 


0.93 


0.95 


NM_005454_at 


CER1 


0.81 


0.89 


0.99 


NM_006079_at 


CITED2 


2.74 


1.93 


1.63 


NM_001845_at 


COL4A1 


5.29 


4.22 


4.07 


NM_001846_at 


COL4A2 


3.84 


2.59 


2.30 


NM_000091_at 


COL4A3 


1.10 


1.04 


1.01 


NM_000092_at 


COL4A4 


1.54 


1.16 


1.24 


NM_000495_at 


COL4A5 


1.38 


1.79 


1.42 


NM_001847_at 


COL4A6 


0.93 


0.95 


0.95 


NM_000096_at 


CP 


0.90 


0.87 


0.66 


NM_001893_at 


CSNK1D 


1.47 


1.05 


1.09 


NM_001904_at 


CTNNB1 


0.86 


1.18 


1.05 


NM_012242_at 


DKK1 


1.07 


3.86 


1.75 


NM_001422_at 


ELF5 


0.89 


0.97 


1.01 


NM_002354_at 


EPCAM 


14.99 


14.35 


5.72 


NM_004448_at 


ERBB2 


1.37 


1.28 


1.37 


NM_000800_at 


FGF1 


1.10 


1.02 


1.07 


NM_002006_at 


FGF2 


0.98 


0.99 


0.93 


NM_002009_at 


FGF7 


1.54 


1.28 


1.17 


NM_006119_at 


FGF8 


0.99 


1.09 


1.09 


NM_015850_at 


FGFR1 


1.11 


1.04 


0.98 



Cancers 2012, 4 



964 



Table S2. Cont. 



D rn kp Caf Tit 


vrcllc fldlllc 


\ i.,.,n vc i \( '\ cm 


\ I.,., n VC F 'j i-l \r ncc 
IVlctlll r V r.dl 1 1 V V 


Mean VC \ati> IK T 


mm ooo 1 a 1 Qt 


FfTPR 9 
r yjr ivz 


4 99 

4.ZZ 


9 38 


1 68 


mm 009096 Qt 


FM1 
r in 1 


1 08 


1 1 7 
1.1/ 


1 98 
l.zo 


MM 0044Q6 Qt 


POYA 1 


0 41 
u.4 1 


0 S3 
U.J J 


0 49 

U.4Z 


MM 09 1 784 at 
1N1V1 UZ 1 / 84 dl 


COY A 9 
FUAAZ 


0 £1 

u.o 1 


u.o / 


U.OO 


MM 909009 at 
INlYi ZUZUUZ dL 




0 87 
U.o 1 


1 04 
1 .U4 


1 36 
1 .JO 


mm oo63so at 

1NI1V1 UUOj JU dl 




1 39 
1JZ 


9 08 
Z.Uo 


1 QS 

1 .yJ 


MM 00S860 Qt 
1N1V1 UUJouU dL 


F<iTT 3 


3 3Q 

j. jy 


1 f\A 
1 .D4 


1 S7 
1 . J / 


mm nnm s 1 Qt 

IN 1V1 UUU 1 J 1 dl 


n6pr" 

UOrL 


1 S3 


1 30 
1 .JU 


1 1 3 
1 . 1 J 


mm 0090S9 Qt 

1N1V1 UUZUJZ dl 


n AT A4 
VJ/A 1 /\4 


0 67 
U.O / 


0 7S 
U. / J 


0 81 
U.o 1 


mm oos9S7 at 

rNlVl UUJZJ / dL 


KJr\ 1 /\0 


9 84 

Z. o4 


1 A7 
l.O/ 


0 Q1 
\).y 1 


mm nn 140 s ; Qt 

1N1V1 UU14yj dl 


ur IvAZ 


1 09 
1 .UZ 


1 01 
1 .u 1 


1 01 
1 .U 1 


MM 004484 at 

INlYi UU44o4 dL 




9 07 
Z.U / 


7 77 


10 13 
1 U. 1 J 


mm 009086 Qt 

INIYI UUZUoO dL 


vjiivdz 


0 86 
U.oO 


0 Q4 


0 Q6 
u.yo 


MM D1 3379 Qt 
IN 1V1 Ul JJ /Z dL 


UIvizlVl 1 


1 9S 

1 -ZJ 


3 AA 
J. 00 


9 0Q 
z.uy 


MM 09 1 Q73 Qt 
IN 1V1 UZ17/J dL 


W AMF19 
HILrYlN 1JZ 


9 93 

Z.Zj 


1 71 
1 . / 1 


1 3S 
1 .J J 


MM 004404 Qt 
1NI1V1 UU44V4 dl 




1 OA 

1 .uo 


1 1 A 
1.10 


1 9S 

1 .ZJ 


MM 014S71 Qt 
INIYI U14J / 1 dL 


III, 1 1 * 


1 0Q 


1 1 A 

1.10 


1 1 3 
1 . 1 J 


mm 000601 Qt 

1N1V1 UUU0U1 dl 


ur.c 
nur 


1 1 8 


1 01 

1 .u 1 


0 Q8 


MM 00979Q Qt 
INIYI UUZ / Ly dL 


uuey 
1 1 1 1 1 , . \ 


0 83 
U.oJ 


1 97 
1 .Z / 


1 1 0 
1 . 1 u 


MM 091QS8 Qt 

iNiivi uziyjo di 


III. A 


1 1 6 
1.10 


1 1 4 
1. 14 


1 06 
1 .UO 


MM 003483 Qt 
INIYI UUj4oj dl 


tiTMn A 9 


0 Q3 


0 Q4 


1 1 9 

1. 1Z 


MM 009 19Q Qt 
1MV1 dL 


inVTf}R9 
niviO-DZ 


9 S3 


3 9A 
J.ZO 


3 46 
J. 40 


MM 000S4S Qt 
INIYI UUUJ4J dl 


WMF 1 A 

riiNir i/\ 


0 79 
U. /Z 


0 87 
U.o / 


0 86 
U.oO 


MM 0004S8 Qt 
INIYI UUU4JO dL 


niN r 1 J3 


1 33 
1 .J j 


0 QQ 

\j.yy 


1 1 1 
1.11 


MM 0004S7 Qt 
INIYI UUU4J / dL 


UMFi A 
niN r 4/\ 


0 86 
U.oO 


0 QA 
u.yo 


1 04 

1 .U4 


MM 0068Q6 Qt 
INIYI UUO07D dL 


1 in Y A 7 


0 QS 


0 Q8 

U.I70 


0 Q7 


MM 00SS9Q Qt 

1MV1 UUJJZ7 dL 


norUz 


1 99 

1 .zz 


1 OS 
1 .UJ 


1 04 

1 .U4 


MM 01940S Qt 
1NI1V1 U1Z4UJ dl 


ll^lvl 1 


0 84 
U.o4 


0 89 
U.oZ 


0 Q7 


MM 009 1 66 Qt 
INIYI UUZIOD dL 


1 1 7 _ 


0 83 
U.oJ 


0 77 
U. / / 


0 68 
U.Oo 


MM 009 1 67 Qt 
1N1V1 UUZ10/ dl 


TF13 
llJj 


9 36 
ZJD 


1 

1 .00 


1 98 
1 .Zo 


MM 000619 Qt 
INIYI UUUOIZ dL 


lurz 


1 03 
1 .Uj 


0 Q8 

U.I70 


0 44 

U.44 


MM 009 1 84 at 
INIYI UUZ104 dL 


TT 6<5T 


1 OS 
1 .Uj 


0 QA 
u.yo 


0 03 
u.y j 


MM 0091 Q1 Qt 
IN 1V1 UUZ 1 y 1 dl 


1 . \ t 1 1 \ 


1 07 
1 .U / 


1 08 
l.Uo 


1 OS 

1 .uj 


MM 009 1 Q9 Qt 
rNivi UUZ 1 7Z dL 


TOUD A 


0 86 
U.oO 


1 39 
1JZ 


1 96 
1 .zo 


MM 009 1 Q3 Qt 

i>iivi uuziyj di 


1 in t loo 


3 1 4 
J. 14 


1 79 
1 . /Z 


1 48 
1 .4o 


MM 00SS38 Qt 
1MV1 UUJJJO dL 




0 f\fs 

U.OO 


0 77 
U. / / 


0 77 
U. / / 


MM 03147Q Qt 

rNlVl UJ14/7 dL 


TMWRF 

1 . \ I 1 I D 1 1 


0 70 

u. / u 


0 67 
u.o / 


0 30 

U. Jl7 


NM_005544_dt 


IRS1 


0.88 


0.99 


1.01 


NM_003749_dt 


IRS2 


1.40 


1.09 


1.09 


NM_002204_dt 


ITGA3 


1.38 


1.19 


1.34 


NM_002205_dt 


ITGA5 


1.13 


0.94 


0.95 


NM_000210_dt 


ITGA6 


1.44 


1.83 


1.76 


NM_033668_at 


ITGB1 


0.83 


0.91 


0.92 



Cancers 2012, 4 



965 



Table S2. Cont. 



D rn kp Caf Tit 


(inn nomit 

vrcllc 11 dill L 


\ i.,.,n vc i \( '\ cm 




Mean VC \ati> IK T 

ivictin r v idle 1 1 v v 


mm 0009 1 3 at 

rNlVl UUUZlj dl 


TT(TRA 
1 1 UD4 


1 04 


1 0^ 
1 .uo 


1 0Q 
1 .\)y 


mm 0009 1 4 at 

INIYI UWWZlt dL 


JAUl 


9 99 
z.zz 


9 0^i 


1 01 


MM 000999 at 


IV 1 1 


9 0^ 


1 ^1 

1 .0 1 


1 40 
1 .Hy 


mm nm 300 at 

1N1V1 UUljUU dl 


l^T FA 


3 Q7 

J.y I 


9 

z.oo 


1 73 
1. 1 J 


mm 004Q8S at 

INIYI UWyoJ dL 




0 09 


0 80 


0 71 
U. / 1 


MM 00997A at 
1N1V1 UUZZ/0 dl 


IV rv 1 1 y 


^ 71 


9 QA 

z.yo 


1 8A 
1.80 


MM 00049A at 
1MV1 UWWHZD dL 


T AM A 9 


A 34 


9 3 1 

Z.J 1 


1 7A 
1 . / O 


MM 000997 at 
1N1V1 UUUZZ / dl 


T AAA A 3 


1 3A 


1 43 
1 .HJ 


1 81 
l.ol 


MM 0099Q0 at 

iNivi uuzzyu dl 


T AAA A A 


119 
1. 1Z 


1 97 
1 .Z / 


1 9A 
1 .ZO 


MM 00SSA0 at 


T A AA A ^ 


1 07 


0 Q8 

U.70 


1 09 
1 -UZ 


MM 0099Q1 at 

iNivi uuzzyi dl 


T AA/TR1 
LA 1V1 L> 1 


3 44 


9 4^ 


1 Q1 

1 .y 1 


MM 0099Q9 at 

INIYI UUZZyZ dL 


T A A/TR9 


1 1 4 


0 03 


0 89 
U.oZ 


MM 000998 at 
rNlVl UUUZZo dL 


T AA/TU'* 


0 8^ 


0 Q3 

U.7J 


0 Q9 
u.yz 


MM 0073 SA at 
rNlvl UU/JJU dL 


T AAAVIA 


0 


1 OA 
1 -WO 


1 OS 

1 .UJ 


MM 0099Q3 at 

INIYI UUZZyj dL 


T A A/TP 1 


1 43 
1 .'HO 


1 S8 


1 AA 
1 .OO 


MM 00SSA9 at 
1N1V1 UUJJOZ dl 


T AA/TP9 


1 07 
1 .U / 


0 QA 

u.yo 


1 01 
1 .u 1 


MM 00A0SQ at 
INIYI UUOUjy dl 


T AAAd 


9 33 
Z.JJ 


1 41 

1 .H 1 


1 30 
1 . jU 


MM 01A9AQ at 
1N1V1 uiozoy dl 




1 1 Q 


1 1 A 
1.10 


1 90 
1 .ZU 


MM 00478Q at 
INIYI UU^+ / 07 dL 


T J 1 Y9 

LI 1 A.Z 


1 1 1 


0 87 


0 77 
U. / / 


MM 003010 at 
1N1V1 UUjU 1 U dl 


1VT AP91^zL 


1 07 
1 .U / 


0 Q7 


0 QS 


MM 004834 at 


A/T APzLl^zL 


0 Q9 


1 9Q 

1 .Ly 


1 33 


MM 001 3 1 S at 
INIYI UUljlJ dL 


AA A P1<T 1 A 


0 87 


1 09 
1 .wz 


1 OA 
1 .uo 


MM 0097^0 at 
1N1V1 UUZ/JU dl 


IVLAx rvo 


0 Q8 
V.yo 


1 00 

1 .uu 


1 0Q 
1 .\)y 


MM 009 3 Q1 at 
INIYI \}\)Z*jy i dL 


1V1 LJ IV 


1 71 
1 . / 1 


9 AQ 

z.oy 


9 49 


MM 00094S at 
INIYI UUUZMO dL 


1V1I2/ 1 


0 7Q 
\). ly 


1 1 9 
1 . 1 z 


1 90 
1 .zy 


MM 00949 1 at 
INIYI UUZH-Zl dL 


A/TA/TP 1 
1V11V1L 1 


1 31 


1 40 
1 .'H-U 


3 07 
J.U / 


MM 00SQ40 at 
rNivi uuj"h-u di 


A/TA/TP 1 1 


1 03 
1 .Uj 


1 07 
1 .U / 


1 0Q 
1 .\)y 


MM 00949A at 
INIYI UUZ^+ZO dl 


A/TA/TP 1 0 

IVllVlL 1 z 


1 1 1 
1.11 


Q 73 


9 1 08 
Z 1 .Uo 


MM 004QCK at 
INIYI UU477J dL 


A/TA/TP 1 A 


0 8S 


0 8^ 


0 09 
u.yz 


MM 009498 at 
1N1V1 UUZ^+Zo dl 


A/TA/TP 1 ^ 


0 73 
U. / J 


0 71 
U. / 1 


0 73 
U. / J 


MM 00SQ41 at 
INIYI UUjy^t 1 dL 


A/TA/TP 1 A 


0 Q9 


0 QQ 


1 01 
1 .u 1 


MM 01 A1 SS at 
INIYI U101JJ dL 


A/TA/TP 1 7 
IVllVlL 1 / 




0 84 


0 8S 
U.oJ 


MM 00949Q at 
1N1V1 uuz^+zy dl 


A/TA/TP 1 Q 


1 48 


1 1 8 
1 . 1 o 


1 91 

1 .Z 1 


MM 004S30 at 
1MV1 UWtJjU dL 


A/TA/TP9 
IVllvlLZ 


S 1 4 
J. It- 


4 S7 


3 03 
j.Uj 


MM 0994A8 at 
INIYI UZZ^+Oo dl 


A/TA/TP9^ 
IvllVLr Zj 


0 Q9 


0 84 
U.ot 


0 Q3 


MM 009493 at 
INIYI UUZ4ZJ dL 


A/TA/TP7 
1V11V1L / 


7 08 


S 04 


S 90 

J.Zy 


MM 004QQ4 at 

1N1V1 UU4774 dL 


A/TA/TP Q 

1V11V1L " 


1 99 
1 .zz 


9 4S 


4 3S 


NM_020998_at 


MST1 


0.93 


0.62 


0.53 


NM_005955_at 


MTF1 


0.89 


0.95 


0.98 


NM_005378_at 


MYCN 


0.95 


1.08 


1.07 


NM_002487_dt 


NDN 


2.15 


1.28 


1.25 


NM_000267_at 


NF1 


0.91 


0.89 


0.92 


NM_003998_at 


NFKB1 


1.28 


1.07 


1.10 



Cancers 2012, 4 



966 



Table S2. Cont. 



D rn kp Caf Tit 




\ I.,.,n VC I \( 'V PIP 


\ [in. n I.V ' 17 ,j ,.1 . , JJCC 


Mean VC \ati> HCC 


mm 009S08 Qt 


MTD1 


1 1 1 
1.11 


0 Q0 
u.yu 


0 8A 
U.oO 


mm 0143A0 Qt 


INlVAZ-o 


0 QQ 

V.yy 


1 1 S 

1 . 1 J 


1 OA 
1 .UO 


MM 094408 Qt 


MOTPT^n 
In \J 1 LnZ 


1 OS 
1 -UJ 


0 89 
U. oZ 


0 8A 
U.oO 


mm 00043 s Qt 

1N1V1 UUUtJJ dl 


MOTr'lT^ 
IN (J l^rl J 


1 08 
1 .US 


1 OA 
1 .UO 


119 
1. 1Z 


mm 003899 Qt 

1N1V1 UUjOZZ dL 


MR S A 9 


0 S7 

U.J / 


0 09 

u.yz 


0 7S 

U. / J 


mm oosoi 1 Qt 

1NI1V1 UUJU1 1 dl 


MB T71 


1 1 1 
1.11 


1 OQ 

1 .uy 


1 08 
1 .Uo 


mm 004SS8 Qt 

1MV1 UU4JJO dL 


MT? TM 
IN IV 1 IN 


0 33 
U.J J 


0 3Q 

U.J7 


0 3A 
U. JO 


MM 0044Q8 Qt 


OMPrT TT1 


1 03 
1 .UJ 


1 01 

1 .u 1 


0 Q9 

u.yz 


mm 0048S9 Qt 

INIVI UUtoJZ dl 


OMPrT TT9 


0 88 
U.oo 


0 Q8 


0 Q3 

u.yj 


mm 090S30 Qt 

1MV1 UZUJjU dL 




1 1 A 
I.IO 


1 1 7 
1.1/ 


1 1 8 
1 . 1 0 


MM 00A1Q1 Qt 
1M 1V1 UUO 1 y 1 dl 


P A 9fl4 


1 OA 

1 .uo 


0 Q3 

u.yj 


1 1 1 
1.11 


MM oos3Q9 Qt 

1MV1 UUJJ^Z dL 


rnr z 


1 0Q 

1 .Vy 


0 Q0 
u.yu 


0 87 
U.o / 


mm ooa9 1 s Qt 

1N1V1 UUOZ 1 o dL 




1 1 1 
1.11 


1 1 1 

1.11 


1 04 

1 .V-r 


MM 1 S 1 S04 at 
rNlVl lolJUH dL 


PT1^3R 1 

^IIVJIVI 


1 99 
1 .zz 


0 87 
U.o / 


0 77 
u. / / 


MM 0097A3 Qt 


DROY1 


0 7A 

u. / 0 


0 fsl 

U.O / 


0 43 


MM 009 89 S Qt 
INIVI UUZoZJ dl 


PTM 
r 1 IN 


9 08 
Z.Uo 


1 SA 
l.JO 


1 37 
1 . J / 


MM 009QS7 Qt 
1MV1 \)\)Ayj I dL 


DVD A 
I\ . \ I\ . \ 


0 S9 

U. JZ 


0 S9 

U. JZ 


0 S3 

U.J J 


MM 019439 Qt 


QFTT1R 1 


0 Q8 


1 1 1 
1.11 


1 1 A 
1.10 


mm 00301 s Qt 

1MV1 UUjUI J dL 


orJvrj 


9 SO 

Z. JU 


1 71 
1 . / 1 


1 41 
1 .H 1 


mm nn^oni Q t 
iNiivi uujyu i di 


CM A Fi9 


1 3A 


1 44 
1 .HH 


1 48 
1 .Ho 


mm oosqo9 Qt 
i>iivi uujyuz di 


cyAm 
0IVLA.UJ 


0 Q0 

u.yu 


0 Q7 

V.y I 


1 00 
1 .UU 


mm oos3sq Qt 

1MV1 UUJJJ7 dL 




0 Q7 

V.y 1 


0 QQ 

V.yy 


0 Q8 

U.70 


mm oosqo3 Qt 

INIVI UUjyUj dl 


cm a ns 


0 87 
U.o / 


0 Q3 

u.y J 


0 QO 

u.yu 


mm oossas Qt 

1MV1 UUJJoJ dL 


^mafia 


1 1 7 
1.1/ 


1 OA 
1 .UO 


0 QQ 

V.yy 


mm oosqo4 at 

1MV1 UUJyU^t dl 




9 Q4 


1 77 
1 . / / 


1 9S 

1 .ZJ 


mm oosqos Qt 

1MV1 UUJyUJ dl 


anno 


0 Q7 

v.y 1 


0 Q8 
V.yo 


0 Q7 

V.y 1 


MM 0994S4 at 

1MV1 WZZH-JH- dL 


CDY1 7 


1 90 
1 .zu 


1 08 
1 .Uo 


0 QA 
u.yo 


mm 00034A Qt 

1NI1V1 UUUJ40 dl 




4 81 
H.o 1 


9 QQ 

L.yy 


3 1 9 
J. 1Z 


mm ooosa9 Qt 

INlYi UUUJOZ dL 


or r 1 


Q 7S 


Q S4 

7. J^+ 


1 A 41 
10.41 


mm 003 1 37 Qt 

INIVI UUJ 1 J / dl 


oKrivl 


0 7A 
U. / 0 


0 Q9 

u.yz 


1 OS 
1 .UJ 


mm 003 1 so Qt 

1MV1 UUjlJU dL 


•5TAT3 
olALj 


0 83 
U.oJ 


0 S7 
U.J / 


0 so 

U. JU 


MM 018934 Qt 
1MV1 UloZjM- dL 


<iTF AP3 


0 S7 

U.J / 


0 S7 

U.J / 


0 34 
U. j^t 


MM 01ASAQ Qt 
INIVI uiojoy dl 


1 DAj 


0 78 
U. / 0 


1 SS 

1 .J J 


1 7A 
1 . / 0 


mm 003900 Qt 

rNlVl UUJZUU dL 


TPF3 


1 03 
1 -UJ 


1 07 
1 .U / 


1 1 0 
1 . 1 u 


mm oooaao Qt 

1NI1V1 UUUOOU dl 


TraR 1 
1 urD 1 


3 1 3 
J. 1 J 


1 QO 
l.yV 


1 7S 

1. / J 


mm 003938 Qt 


1 yjr DZ 


0 QS 

U.yJ 


0 QQ 

V.yy 


0 Q8 

U.70 


MM 003930 Qt 

rNlVl UWJZJ7 dL 


1 v j ldj 


1 00 
1 .uu 


0 Q3 

V.y J 


0 Q7 

v.y 1 


NM_004612_at 


TGFBR1 


0.91 


1.03 


1.08 


NM_003242_at 


TGFBR2 


1.51 


1.39 


1.17 


NM_003243_at 


TGFBR3 


1.19 


0.67 


0.56 


NM_003255_dt 


TIMP2 


1.43 


1.50 


1.33 


NM_003256_at 


TIMP4 


0.92 


0.88 


0.89 


NM_000594_dt 


TNF 


0.90 


0.91 


0.98 



Cancers 2012, 4 



967 



Table S2. Cont. 



D rn kp Caf Tit 




\ I.,.,n VC TTPV PTR 


\ i.,., n vc ft o i-i i; nee 


Mean VC l5»tt» IK T 


MM D1 S^49 cit 
1MV1 dL 


T TPF9 






1 .ZD 


NM 001180 at 


VIM 


5 51 


4 74 


5 16 


NM_000378_at 


WT1 


1.08 


1.09 


1.08 


NM_005080_at 


XBP1 


0.64 


0.86 


0.71 


NM 006106 at 


YAP1 


1.06 


0.96 


0.75 


NM_015642_at 


ZBTB20 


2.03 


1.38 


1.25 


NM_014943_at 


ZHX2 


1.11 


1.02 


1.09 


NM_004773_at 


ZNHIT3 


1.25 


1.40 


1.38 



Table S3. Individual fold-change for BMP inhibitors in HCC samples. 
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1.1 


1.1 


Tl_607 


2.6 


1.6 


1.1 


0.8 


T2_116 


1.7 


17.2 


0.9 


4.3 


T2_184 


1.3 


1.7 


13.7 


1.0 


T2 309 


1.6 


2.5 


1.0 


1.7 


T2_334 


2.1 


3.6 


1.0 


1.9 


T2_342 


0.9 


15.4 


4.6 


3.9 


T2_388 


3.2 


1.9 


27.5 


0.9 


T2_422 


1.0 


1.7 


1.4 


2.8 


T2_451 


2.5 


1.8 


0.9 


1.6 


T2_507 


1.7 


1.3 


1.1 


1.1 


T2_524 


2.1 


2.7 


1.2 


1.3 


T2_550 


1.2 


23.5 


1.4 


3.8 


T2_588 


1.1 


27.7 


2.4 


1.2 


T2_614 


1.1 


1.1 


1.2 


2.1 


T2_657 


1.1 


6.3 


12.7 


1.7 


T2 666 


1.5 


12.4 


1.2 


1.8 


T2_718 


1.1 


1.0 


0.9 


1.8 


T2_728 


1.1 


10.9 


13.0 


1.1 


T2_753 


3.0 


1.5 


2.6 


0.5 


T2_753B 


1.8 


0.8 


1.1 


0.3 


T2_787 


1.2 


2.5 


1.4 


1.7 


T2R2926 


1.4 


41.5 


8.0 


6.3 


T2R2927 


1.4 


1.4 


0.9 


3.8 


T2R2928 


1.3 


1.3 


1.1 


2.9 


T2 R2929 


1.0 


14.3 


0.9 


2.2 


T2_ R3502 


2.0 


1.9 


1.1 


0.8 


T2_VM3 


2.3 


3.0 


0.9 


0.7 


T2_VM4 


2.7 


1.1 


1.5 


0.7 


T3_256 


3.2 


1.4 


0.9 


1.4 


T3_297 


1.5 


2.5 


1.0 


4.4 



Cancers 2012, 4 



968 



Table S3. Cont. 



W O 111 W\l i k 


r31 1 jO 


rpri 


V I I\ I \ .> 1 1 


r o i 


T"? ^9Q 


1 9 

1 -Z 


1 A 9 


1 1 

1 . 1 


3 1 
j. i 


3^8 


1 8 


1 n 


1 n 
1 .u 


1 A 


A1 1 
1 J H- 1 1 


1 1 
l . l 


1 A 

1 .H 


O.J 


n q 


1 J JOH 


1 A 
1 


1 ^ 9 

1 J.Z 


1 n 
1 .u 


A 9 


T"3 ^97 


1 1 
1 . 1 


sn ^ 


1 3 
i .j 


1 8 
l . o 


ta ^?nn 


9 1 
Z. 1 


1 9 n 

1Z.U 


1 a 

1.0 


1 Q 

1 .y 


TA 


n q 


D.y 


^ n 


J.J 


ta Ann 


1 9 

1 .Z 


9 3 
Z.J 


1 i 

1.-5 


n 7 
u. / 


TA 1 
It Jjl 


1 9 
1 -Z 


1 1 
1 . 1 


1 1 
1 . 1 


9 A 

z.o 


TA ^9 


1 n 


1 1 1 
11.1 


n 8 


n 8 


TA 81 n 


1 ^ 


9S 1 

Z J . 1 


1 n 

i .yj 


1 8 
1 . o 


T4_R2858 


1.3 


21.3 


1.1 


4.4 


T4A_VM1 


2.3 


3.0 


1.0 


1.1 


T4B_324 


1.0 


24.5 


0.9 


1.3 


T4B_353 


1.0 


1.2 


11.1 


1.7 


T4B_381 


1.9 


1.6 


1.3 


1.3 


T4B_382 


1.5 


1.9 


1.4 


2.2 
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